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MSPSPTALFCL 11 

GGAGTCGACCCACGCGTCCGCAGGGCTGAGGAACC ATG TCT CCA TCC CCG ACC GCC CTC TTC TGT CTT 68 

GLCLGRVPAQSGPLPKPSLQ 31 

GGG CTG TGT CTG GGG CGT GTG CCA GCG CAG AGT GGA CCG CTC CCC AAG CCC TCC CTC CAG 128 

ALPSSLVPLEKPVTLRCQGP 51 

GCT CTG CCC AGC TCC CTG GTG CCC CTG GAG AAG CCA GTG ACC CTC CGG TGC CAG GGA CCT 188 

PGVDLYRLEK.LSSSRYQDQA'71 

CCG GGC GTG GAC CTG TAC CGC CTG GAG AAG CTG AGT TCC AGC AGG TAC CAG GAT CAG GCA 248 

VLFIPAMKRSLAGRYRCSYQ 91 

GTC CTC TTC ATC CCG GCC ATG AAG AGA AGT CTG GCT GGA CGC TAC CGC TGC TCC TAC CAG 308 

NGSLWSLPSDQLELVATGVF 111 

AAC GGA AGC CTC TGG TCC CTG CCC AGC GAC CAG CTG GAG CTC GTT GCC ACG GGA GTT TTT 368 

AKPSLSAQPGPAVSSGGDVT 131 

GCC AAA CCC TCG CTC TCA GCC CAG CCC GGC CCG GCG GTG TCG TCA GGA GGG GAC GTA ACC 428 

LQCQTRYGFDQFALYKEGDP 151 

CTA CAG TGT CAG ACT CGG TAT GGC TTT GAC CAA TTT GCT CTG TAC AAG GAA GGG GAC CCT 488 

APYKNPERWYRASFPI I T V T 171 

GCG CCC TAC AAG AAT CCC GAG AGA TGG TAC CGG GCT AGT TTC CCC ATC ATC ACG GTG ACC 548 

AAHSGTYRCYSFSSRDPYLW 191 

GCC GCC CAC AGC GGA ACC TAC CGA TGC TAC AGC TTC TCC AGC AGG GAC CCA TAC CTG TGG 608 

SAPSDPlE LVVTGTSVTPSR 211 

TCG GCC CCC AGC GAC CCC CTG GAG CTT GTG GTC ACA GGA ACC TCT GTG ACC CCC AGC CGG 668 

LPTEPPSSVAEFSEATAELT 231 

TTA CCA ACA GAA CCA CCT TCC TCG GTA GCA GAA TTC TCA GAA GCC ACC GCT GAA CTG ACC 728 
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LPPLPQTRKSHGGQDGGRQD 331 
CTG CCG CCC CTC CCG CAG ACC CGG AAA TCA CAC GGG GGT CAG GAT GGA GGC CGA CAG GAT 1028 



CCGCTGAACCCCAGGCACGGTCGTATCCAAGGGAGGGATCATGGCATGGGAGGCGACTCAAAGACTGGCGTGTGTGGAG 1 134 
CGTGGAAGCAGGAGGGCAGAGGCTACAGCTGTGGAAACGAGGCCATGCTGCCTCCTCCTGGTGTTCCATCAGGGAGCCG 1213 
TTCGGCCAGTGTCTGTCTGTCTGTCTGCCTCTCTGTCTGAGGGCACCCTCCATTTGGGATGGAAGGAATCTGTGGAGAC 1292 
CCCATCCTCCTCCCTGCACACTGTGGATGACATGGTACCCTGGCTGGACCACATACTGGCCTCTTTCTTCAACCTCTCT 1371 
MTATGGGCTCCAGACGGATCTCTMGGTTCCCAGCTCTCAGGGTTGACTCTGTTCCATCCTCTGTGCAAAATCCTCCT 1450 
GTGCTTCCCTTTGGCCCTCTGTGCTCTTGTCTGGTTTTCCCCAGAAACTCTCACCCTCACTCCATCTCCCACTGCGGTC 1529 
TMCAMTCTCCmCGTCTCTCAGMCGGGTCTTGCAGGCAGmGGGTATGTCATTCATmCCTTAGTGTAAAACT 1608 
AGCACGTTGCCCGC1TTCCCTTCACATTAGAAMCMGATCAGCCTGTGCMCATGGTGAAACCTCATCTCTACCAACAA 1687 
MCAAAAAMCACAAAAATTAGCCAGGTGTGGTGGTGCATCCCTATACTCCCAGCAACTCGGGGGGCTGAGGTGGGAGA 1766 
ATGGCTTGAGCCTGGGAGGCAGAGGTTGCAGTGAGCTGAGATCACACCACTGCACTCTAGCTCGGGTGACGAAGCCTGA 1845 
CCTTGTCTCAAAAMTACAGGGATGMTATGTCMTTACCCTGATTTGATCATAGCACGTTGTATACATGTACTGCAAT 1924 
ATTCCTGTCCACCCCATAMTATGTACMTTATGTATACATTTT^^ 2003 
AAAAAAAAAAAAAAGGGCGGGCCGCTAGACTAGTCTAGAGAACA 2047 



VHSRGLCS* 
GTT CAC AGC CGC GGG TTA TGT TCA TGA 
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MSPSPTALFCLGLCLGRVPAQSGPLPKPSLQALPSSLVPLEKPVTLRCQGPPGVDLYRLE 
KLSSSRYQDQAVLFIPAMKRSLAGRYRCSYQNGSLWSLPSDQLELVATGVFAKPSLSAQP 
GPAVSSGGDVTLQCQTRYGFDQFALYKEGDPAPYKNPERWYRASFP 1 1 TVTAAHSGTYRC 
YSFSSRDPYLWSAPSDPLELWTGTSVTPSRLPTEPPSSVAEFSEATAELTVSFTNKVFT 
TETSRS I TTSPKESDSPAGPARQYYTKGNLVR ICLGAVI L 1 1 LAGFLAEDWHSRRKRLRH 
RGRAVQRPLPPLPPLPQTRKSHGGQDGGRQDVHSRGLCS 
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10 20 30 40 50 60 70 

inputs ATGACGCCCGCCCTCACAGCCCTGCTCTGCCTTGGGCTGAGTCTGGGCCCCAGGACCCGCGTGCAGGCAG 



ATGTCTCCATCCCCGACCGCCCTCTTCTGTCTTGGGCTGTGTCTGGGGCG-TGTGCCAGC - - GCAGAGTG 
10 20 30 40 50 60 

80 90 100 110 120 130 

i nputs GGCCCTTCCCCAAACCCACCCTCTGGGCTGAGCCAGGCTCTGTGAT- CAGCTGGGGGAGCCCCGTGACCA 

GACCGCTCCCCAAGCCCTCCCTCCAGGCTCTGCCCAGCTCCCTGGTGCCCCTGGAGAAGCCA-GTGACCC 
70 80 90 100 110 120 130 

140 150 160 170 180 190 200 

inputs TCTGGTGTCAGGGGAGCCTGGAGGCCCAGGAGTACCGACTGGATAAAGAGGGAAGCCCAGAGCCCTTGGA 

TCCGGTGCCAGGG - -ACCT CCGGGCGTG- - GACCTGTA CCGCCTGGAG AAG 

140 150 160 170 180 

210 220 230 240 250 260 270 

inputs CAGAAATAACCCACTGGAACCCAAGAACAAGGCCAGATTCTCCATCCCATCCATGACAGAGCACCATGCG 



CTGAGTT- -CCAGCAGGTACC-AGGATCA-GGCAGTCCTCTTCATCCCGGCCATGAAGAGAAGTCTGGCT 
190 200 210 220 230 240 

280 290 300 310 320 330 340 

i nputs GGGAGATACCGCTGCCACTATTACAGCTCTGCAG- -GCTGGTCAGAGCCCAGCGACCCCCTGGAGCTGGT 

GGACGCTACCGCTGCTCCTAC - - CAGAACGGAAGCCTCTGGTCCCTGCCCAGCGACCAGCTGGAGCTCGT 
250 260 270 280 290 300 310 

350 360 370 380 390 400 410 

inputs GATGACAGGATTCTACAACAAACCCACCCTCTCAGCCCTGCCCAGCCCTGTGGTGGCCTCAGGGGGGAAT 

TGCCACGGGAGTTTTTGCCAAACCCTCGCTCTCAGCCCAGCCCGGCCCGGCGGTGTCGTCAGGAGGGGAC 
320 330 340 350 360 370 380 

420 430 440 450 460 470 480 

inputs ATGACCCTCCGATGTGGCTCACAGAAGGGATATCACCATTTTGTTCTGATGAAGGAAGGAGAACACCAGC 



GTMCCCTACAGTG';.AGACTCGGTATGn':Tn-GACCAATTTGCTCTGTACAAGGAAGG 

3 9 ■ 'i 41.;: 430 440 

W} v 5i: 530 53^ 540 550 

TCCC:CGGA:;0' GG^G : OAOAGCAGO'OGACAG^GGGG 



GGACC07G C GCCCTA CAA 

45"i 460 



FIG. 3 A 
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560 570 580 590 600 610 620 

inputs GAACCCCAGCCACAGGTGGAGGTTCACATGCTATTACTATTATATGAACACCCCCCAGGTGTGGTCCCAC 



GAATCCCGA GAGATGGTAC - CGGGCTAGT TT CCCC AT CAT 

470 480 490 500 

630 640 650 660 670 680 690 

inputs CCCAGTGACCCCCTGGAGATTCTGCCCTCAGGCGTGTCTAGGAAGCCCTCCCTCCTGACCCTGCAGGGCC 



CACGGTGACCGCC GCCCACAG 

510 520 

700 710 720 730 740 750 760 

inputs CTGTCCTGGCCCCTGGGCAGAGCCTGACCCTCCAGTGTGGCTCTGATGTCGGCTACGACAGATTTGTTCT 



CGGAACCTA CCGATG - CTACAGC TTCT 

530 540 550 

770 780 790 800 810 820 830 

inputs GTATAAGGAGGGGGAACGTGACTTCCTCCAGCGCCCTGGCCAGCAGCCCCAGGCTGGGCTCTCCCAGGCC 



•CCAGCAG- 



840 850 860 870 880 890 900 

inputs AACTTCACCCTGGGCCCTGTGAGCCCCTCCCACGGGGGCCAGTACAGGTGCTATGGTGCACACAACCTCT 



-GGACCCA - - --- - TACCT-- 

560 

910 920 930 940 950 960 970 

inputs CCTCCGAGTGGTCGGCCCCCAGCGACCCCCTGAACATCCTGATGGCAGGACAGATCTATGACACCGTCTC 



GTGGTCGGCCCCCAGCGACCCCCTGGA GCT- TGTG 

570 580 590 600 

980 990 1000 1010 1020 1030 1040 

inputs CCTGTCAGCACAGCCGGGCCCCACAGTGGCCTCAGGAGAGAACGTGACCCTGCTGTGTCAGTCATGGTGG 



- - -GTCA C AGGAAC CTCTGTGAC C CCCAGC CGGT 

610 620 630 

1050 1060 1070 1080 1090 1100 1110 

inputs CAGTTTGACACTTTCCTTCTGACCAAAGAAGGGGCAGCCCATCCCCCACTGCGTCTGAGATCAATGTACG 



■TACCAACAGAAC- -- CA--CC7TCX - ■ ■- TAG 

640 G50 



1120 1130 1 140 1150 1*6? V : ' ' IPC 

inputs GAGCTCATAAGTACCA6GCTGAATTCCCCATGAGTCCTGTGAC C ' J J A- ..A^A A .iiAAAPA'ACAGGTG 



GTA- - - GCAGAATTCTC AGAAGCCAC - ' "■■ ■ A- ■ ACTG- - A 

660 670 680 690 



FIG.3B 
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1190 1200 1210 1220 1230 1240 1250 

inputs CTACGGCTCATACAGCTCCAACCCCCACCTGCTGTCTTTCCCCAGTGAGCCCCTGGAACTCATGGTCTCA 



C - - CGTCTCATTCA - - - CAAAC AAAGTCTT • - CAC AA CTGAGACT TCT - - 

700 710 720 730 

1260 1270 1280 1290 1300 1310 1320 

inputs GGACACTCTGGAGGCTCCAGCCTCCCACCCACAGGGCCGCCCTCCACACCTGGTCTGGGAAGATACCTGG 



AGGAGTATC - - ACCACCAGTCC AAAGGA - - GTCAGACTCTCCAG - -CTGG 

740 750 760 770 

1330 1340 1350 1360 1370 1380 1390 

inputs AGGTTTTGATTGGGGTCTCGGTGGCCTTCGTCCTGCTGCTCTTCCTCCTCCTCTTCCTCCTCCTCCGACG 



TCCTGC CCGCCAGTA CTAC ACCAAGG 

780 790 800 

1400 1410 1420 1430 1440 1450 1460 

inputs TC AGC GTC AC AGC AAACAC AGGAC ATCTGAC C AGAGAAAGACTGATTTCCAGC GTCCTGC AGGGGCTGCG 



GCAAC CTGGTC - - CGGATAT - - - GCCTC GGGGCTG - - 

810 820 830 

1470 1480 1490 1500 1510 1520 1530 

inputs GAGACAGAGCCCAAGGACAGGGGCCTGCTGAGGAGGTCCAGCCCAGCTGCTGACGTCCAGGAAGAAAACC 



TGATCCTAATAA TCCTG - - GCGGGGTTTCTG GCAGA - GGACTGG C 

840 850 860 870 

1540 1550 1560 1570 1580 1590 1600 

i nputs TCTATGCTGCCGTGAAGGAC ACACAGTCTGAGG - ACAGGGTGGAGCTGGACAGT - C AGAGC C C AC ACG AT 



AC AGCCG - - GAGGAAGCGC - - -CTGCGGCACAGGG GCAGGGCTGTGCAGAGGCCGCT 

880 890 900 910 920 

1610 1620 1630 1640 1650 1660 1670 

inputs GAAGACCCCC AGGCAGTGACGTATGCCCCGGTGAAAC ACTCCAGTCCTAGGAGAGAAATGGCCTCTCCTC 



TCC GCCCCTG ---CCGC C 

930 940 

1680 1690 1700 1710 1720 1730 1740 

inputs CCTCCTCACTGTCTGGGGAATTCCTGGACACAAAGGACAGACAGGTGGAAGAGGACAGGCAGATGGACAC 



CCTCC-CGCAGAC CCGGAAATCA CA- -CGGG GGTCAGG- - -ATGGA- - • 

950 960 970 980 

1750 1760 1770 1780 1790 1800 1810 

inputs TGAGGCTGCTGCATCTGAAGCCTCCCAGGATGTGACCTACGCCC AGCTGCACAGCTTGACCCTTAGACGG 



— GGC CGAC AGGATGTT - CAC AGC CG- 

990 1000 

1820 1830 1840 1850 1850 1870 1880 

inputs AAGGCAACTGAGCCTCCTCCATCCCAGGAAGGGGAACCTCCAGCTGAGCCCAGCATCTACGCCACTCTGG 



■ CGGGTTATG TTCA- 

1010 



1890 
inputs CCATCCAC 



FIG.3C 
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10 20 30 40 50 60 
inputs MSPSPTALFCLGLCLG-RVPAQSGPLPKPSLQALPSSLVPLEKPVTLRCQGPPGVDLYRLEKLSSS 

MTPALTALLCLGLSLGPRTRVQAGPFPKPTL^ 

10 20 30 40 50 60 70 

70 80 90 100 110 120 130 

inputs RYQ DQAVLFIPAMKRSLAGRYRCSYQNGSLWSLPSDQLELVATGVFAKPSLSAQPGPAVSSGGDV 



rnnplepkn^rfsipsmtehhagryrchW 

80 90 100 110 120 130 140 



inputs TLQCQT RY- 



tlrcgsqkgyhhfvlmkegehqlprtldsqqlhsggfqalfpvgpvnpshrwrftcyyyymntpqvwshp 

150 160 170 180 190 200 210 

140 150 

inputs GFDQFALYKEGDP- 



SDPLEILPSGVSRKPSLLTLQGPVLAPGQSLTLQCGSDVGYDRFVLYKEGERDFLQRPGQQPQAGLSQAN 
220 230 240 250 260 270 280 

160 

inputs APYK NP ERW-- 

FTLGPVSPSHGGQYRCYGAHNLSSEWSAPSDPLNILMAGQIYDTVSLSAQPGPTVASGENVTLLCQSWK'Q 
290 300 310 320 330 340 350 

170 180 190 200 

inputs YRASFPIITVTAAHSGTYRCYSFSSRDPYLWSAPSDPLELVVTG 

FDTFLLTKEGMHPPLRLRSMYGAHKYQAEFPMSPVTSAHAGTYRCYGSYSSNP^ 

360 370 380 390 400 410 420 

210 220 230 240 250 260 

ino-jts TSVTPSRLPTEPPSS- -VAEFSEATAELTVSFTNKVF - - -TTETSRSITTSPKESD- -SPAGPA- 



HSGGSSLPPTGPPSTPGLGRYLEVLIGVSVAFVLLLFLLLFLLLRRQRHSKHRTSDQRKTDFQRPAGAAE 
430 440 450 460 470 480 490 

270 280 290 
inputs RQYYTKGNLVRICLGAVIL IILAGFLAEDW '- - - HSRRKR 



TEPKDRGLLRRSSPAADVQEENLYAAVKDTQSEDRVELDSQSPHDEDPQAVTYAPVKHSSPRREMASPPS 

500 510 520 530 540 550 560 

.ttG 310 32(1 330 
: RHRGRAVL - ■ R-A ■ ■ PPLPPLPQTRK- - • SHGGQDGGRQDVHSRGLC 

; ". :^:'.:;R:V:;:::DR;;f'i; r EAAA3-A^QDVT v A0LHSl_T_RRKATE 

•; 7 0 530 5-HO 500 610 520 630 
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*- >GesvtLtCsvsgf gppgvsvtWyf kngk . 1 gpsl 1 gysysrl esgek 
+ vtL+C+ + v y + k ++ r++ + 

hT268 41 EKPVTLRCQGP PGVDLY-RLEK1 SSS RYQDQ- - 70 

anl segrfsi ssl tLti ssvekeDsGtYtCvv<-* 
++L i +++ +G Y+C 
hT268 71 AVLFIPAMKRSLAGRYRCSY 90 



FIG.5A 



hT268 
hT268 



*- >Gesvtl_tCsvsgfgppgvsvtWyf kngk . 1 gpsl 1 gysysrl esgek 
G++vtl_+C+++ + ++ y k+g++ + y+++ 
127 GGDVTLQCQTR - - - YGFDQFALY - KEGDpAP YKNPERWYR - - 162 



an! segrfsi ssl tLti ssvekeDsGtYtCvv<- * 
++++i++v++ sGtY+C 
163 ASFP I ITVTAAHSGTYRCYS 182 



FIG.5B 
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A 1163 
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Cys 

Hgly 

Out 




1 41 81 121 161 201 241 281 



MSPASPTFFC IGLCVLQV I QTQSGPLPKPSLQAQPSSLVPLGQSV I LRCQGPPDVDLYRL 
EKLKPEKYEDQDFLF IPTMERSNAGRYRCSYQNGSHWSLPSDQLEL I ATGVYAKPSLSAH 
PSSAVPQGRDVTLKCQSPYSFDEFVLYKEGDTGPYKRPEKWYRANFP 1 1 TVTAAHSGTYR 
CYSFSSSSPYLWSAPSDPLVL WTGLSATPSQVPTEESFPVTESSRRPS I LPTNK I STTE 
KPMN I TASPEGLSPP IGFAHQHYAKGNLVR I CLG AT 1 1 1 ILLGLLAEDWHSRKKCLQHRM 
RALQRPLPPLPLA 
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10 20 30 40 50 60 70 

inputs ATGACGCCCGCCCTCACAGCCCTGCTCTGCCTTGGGCTGAGTCTGGGCCCCAGGACCCGCGTGCAGGCAG 



ATGTCTCCAGCC - TCAC - - CC ACTTTCTT - - - CTGTAT 

10 20 30 

80 90 100 110 120 130 140 

inputs GGCCCTTCCCCAAACCCACCCTCTGGGCTGAGCCAGGCTCTGTGATCAGCTGGGGGAGCCCCGTGACCAT 



TGGGCTG TGTGTACTGC - 

40 

150 160 170 180 190 200 210 

inputs CTGGTGTCAGGGGAGCCTGGAGGCCCAGGAGTACCGACTGGATAAAGAGGGAAGCCCAGAGCCCTTGGAC 



AAGTGATCC AAACACAGAG TGG - - 

50 60 70 

220 230 240 250 260 270 280 

inputs AGAAATAACCCACTGGAACCCAAGAACAAGGCCAGATTCTCCATCCCATCCATGACAGAGCACCATGCGG 



CCCACT CCC CAAG CCTTCCC - TCCAGG 

80 90 

290 300 310 320 330 340 350 

inputs GGAGATACCGCTGCCACTATTACAGCTCTGCAGGCTGGTCAGAGCCCAGCGACCCCCTGGAGCTGGTGAT 



CTCAGCC CAGTTCCCTG - GTACCCCTGGGTCAG 

100 110 120 

360 370 380 390 400 410 420 

i nputs GACAGGATTCTACAACAAACCCACCCT CTCAGCCCTGCCCAGCCCTGTGGTGGCCTCAGGGGGGAATATG 



- TCAG - - TTATTC TGAGGTG - C - - CAGGGA 

130 140 150 

430 440 450 460 470 480 

i nputs ACCCTCC - GATGTGGCTCACAGAAGGGATATCACCATTTTGTTCTGATGAAGGAAGGAGAACACCAGCTC 



- - CCTCCAGATGTGG A'Hl ATATCGCCT^iASAAAC'T GAAA 

160 17( \h-: \--* ! > 

490 500 510 : .,-l0 

inputs CCCCGGACCCTGGACTCACAGCAGCTC CACAGTGGGGGG < : GCAGGCCC TG " TCCCTGTGGGCCCCGTGA 



■ CCGGA GA AGTATGAAGATCAAGAC - - - TTTCTCTT - - • - -- CATT - 

200 210 220 

run OA 
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560 570 580 590 600 610 620 

inputs ACCCCAGCCACAGGTGGAGGTTCACATGCTATTACTATTATATGAACACCCCCCAGGTGTGGTCCCACCC 

- - -CCAACCATGGAAAGAAGTA- - - ATGCT GGAC GGTAT 

230 240 250 260 

630 640 650 660 670 680 690 

inputs CAGTGACCCCCTGGAGATTCTGCCCTCAGGCGTGTCTAGGAAGCCCTCCCTCCTGACCCTGCAGGGCCCT 

CGATG- - - CTCTTA TCAGA ATGGGAGTC - ACTGGTCTCT 

270 280 290 

700 710 720 730 740 750 760 

inputs GTCCTGGCCCCTGGGCAGAGCCTGACCCTCCAGTGTGGCTCTGATGTCGGCTACGACAGATTTGTTCTGT 

CCCAAG TGACC AGCTTGAG CTAATT - - - GCTAC 

300 310 320 

770 780 790 800 810 820 830 

inputs ATAAGGAGGGGGAACGTGACTTCCTCCAGCGCCCTGGCCAGCAGCCCCAGGCTGGGCTCTCCCAGGCCAA 



- - - AGGTGTGTATGCTAAAC - -CCTC - ACTCTC 

330 340 350 

840 850 860 870 880 890 900 

inputs CTTCACCCTGGGCCCTGTGAGCCCCTCCCACGGGGGCCAGTACAGGTGCTATGGTGCACACAACCTCTCC 

AGCTCATCCCA GCT 

360 

910 920 930 940 950 960 970 

inputs TCCGAGTGGTCGGCCCCCAGCGACCCCCTGAACATCCTGATGGCAGGACAGATCTATGACACCGTCTCCC 

CAGCAGTCCC TC- • -AAGGCAGG- - - GAT - - GTGACTCTGA 

370 380 390 400 

980 990 1000 1010 1020 1030 1040 

inputs TGTCAGCACAGCCGGGCCCCACAGTGGCCTCAGGAGAGAACGTGACCCTGCTGTGTCAGTCATGGTGGCA 

AGT GCCAGAGCCCATA 

410 

1050 1060 1070 1080 1090 1100 1110 

inputs GTTTGACACTTTCCTTCTGACCAAAGAAGGGGCAGCCCATCCCCCACTGCGTCTGAGATCAATGTACGGA 



- ATTCGTTCTATACAAAGMGGGG AT ACTGGGCCTTATA - - AGAGACCTGA 

430 440 450 460 470 



CAGTTTTGATGA- - 
420 
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1120 1130 1140 1150 1160 1170 1180 

inputs GCTCATAAGTACCAGGCTGAATTCCCCATGAGTCCTGTGACCTCAGCCCACGCGGGGACCTACAGGTGCT 



G - - AAATGGTACCGGGCCAATTTCCCCATCATCACAGTGACTGCTGCTCACAGTGGGACGTACCGGTGTT 
480 490 500 510 520 530 540 

1190 1200 1210 1220 1230 1240 1250 

inputs ACGGCTCATACAGCTCCAACCCCCACCTGCTGTCTTTCCCCAGTGAGCCCCTGGAACTCATGGTCTCAGG 



ACAGCTTCTCCAGCTCATCTCCATACCTGTGGTCAGCCCCGAGTGACCCTCTAGTGCTTGTGGTTACTGG 
550 560 570 580 590 600 610 

1260 1270 1280 1290 1300 1310 1320 

inputs ACACTCTGGAGGCTCCAGCCTCCCACCCACAGGGCCGCCCTCCACACCTGGTCTGGGAAGATACCTGGAG 



ACTCTCTG CCA- - CTCCCAGCC - - AGGT - -ACCCAC -GGA-AGAATCATTTCCTG- - - 

620 630 640 650 660 

1330 1340 1350 1360 1370 1380 1390 

inputs GTTTTGATTGGGGTCTCGGTGGCCTTCGTCCTGCTGCTCTTCCTCCTCCTCTTCCTCCTCCTCCGACGTC 



TGA CAGAATCCT C C AGGAGAC CTTCC A TCTTAC CCAC AAACAAA 

670 680 690 700 

1400 1410 1420 1430 1440 1450 1460 

inputs AGCGTCACAGCAAACACAGGACATCTGACCAGAGAAAGACTGATTTCCAGCGTCCTGCAGGGGCTGCGGA 



A - - - TATCTACAA - - - CTGAA AAGCCTATGAATATC - - ACTGCCT - C - TCC AG - AGGGGCTG 

710 720 730 740 750 

1470 1480 1490 1500 1510 1520 1530 

inputs GACAGAGCCCAAGGACAGGGGCCTGCTGAGGAGGTCCAGCCCAGCTGCTGACGTCCAGGAAGAAAACCTC 



AGCCCT CC AATTGGTTTTGCTCATCAGCA C 

760 770 780 

1540 1550 1560 1570 1580 1590 1600 

inputs TATGCTGCCGTGAAGGACACACAGTCTGAGGACAGGGTGGAGCTGGACAGTCAGAGCCCACACGATGAAG 



TATGC CAAGGGGAATCTGGTC CGGATATG 

790 800 810 

1610 1620 1630 1640 1650 1660 1670 

acc:c:aggca9tgacgtatgccccggtgaaacactccagtcctaggagagaaatggcctctcctccctc 



- : : ' "GG - TGC C AC GAT - - - T AT AATAATTTTGT 

<3?0 H30 840 

t>80 1690 1700 1710 1720 1730 1740 

:s :t::actgtctggggaattcctggacacaaaggacagacaggtggaagaggacaggcagatggacactgag 



GGCTT- -CTAG- - -CAGAGGATTGGC- - -ACAGTCGGAAGAA AT 

350 860 870 880 



FIG.8C 
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1750 1760 1770 1780 1790 1800 1810 

nputs GCTGCTGCATCTGAAGCCTCCCAGGATGTGACCTACGCCCAGCTGCACAGCTTGACCCTTAGACGGAAGG 

GC - - CTGCAACA CAGGATGAGA GCTTTGC - AAAGG 

890 900 910 

1820 1830 1840 1850 1860 1870 1880 

nputs CAACTGAGCCTCCTCCATCCCAGGAAGGGGAACCTCCAGCTGAGCCCAGCATCTACGCCACTCTGGCCAT 

CCACTA CCACC CCTCC CACTGGCC - - 

920 930 

1890 
nputs CCAC 
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10 20 30 40 50 60 

inputs MSPASPTFFCIGLCVLQVIQTQSGPLPKPSLQAQPSSLVPLGQSVILRCQGPPDVDLYRLEKL-KPEKYE 

MTPALTALLCLGLSLGPRTRVQAGPFPKPTLWAEPGSVISW 

10 20 30 40 50 60 70 

70 80 90 100 110 120 130 

inputs DQDFL F- IPTMERSNAGRYRCSYQNGSHWSLPSDQLELIATGVYAKPSLSAHPSSAVPQGRDV 



RNNPLEPKNKARFSIPSMTEHHAGRYRCHYYSSAGWSEPSDPLELVMTGFYNKPTLSALPSPVVASGGNM 
80 90 100 110 120 130 140 



inputs TLKC- -QSPY- 



TLRCGSQKGYHHFVLMKEGEHQLPRTLDSQQLHSGGFQALFPVGPVNPSHRWRFTCYYYYMNTPQVWSHP 
150 160 170 180 190 200 210 

140 150 

inputs SFDEFVLYKEGD 



SDPLEILPSGVSRKPSLLTLQGPVLAPGQSLTLQCGSDVGYDRFVLYKEGERDFLQRPGQQPQAGLSQAN 
220 230 240 250 260 270 280 

160 

inputs TGPYK RP EKW-- 

FTLGPVSPSHGGQYRCYGAHNLSSEWSAPSDPLNILMAGQIYDTVSLSAQPGPTVASGENVTLLCQSWWQ 
290 300 310 320 330 340 350 

170 180 190 200 

i nputs YRANFPI ITVTAAHSGTYRCYSFSSSSPYLWSAPSDPLVLVVTG 



FDTFLLTKEGAAHPPLRLRSMYGAHKYQAEFPMSPVTSAHAGTYRCYGSYSSNPHLLSFPSEPLELMVSG 
360 370 380 390 400 410 420 

210 220 
inputs LSATPSQVPTEES FPV - - 



HSGGSSLPPTGPPSTPGLGRYLEVLIGVSVAFVLLLFLLLFLLLRRQRHSKHRTSDQRKTDFQRPAGAAE 
430 440 450 460 470 480 490 

230 240 250 260 270 

input TESS RRPS ILPTNKISTTEKPMNI -TASPEGLSP-PIGFAH- -QHYAKGNLVR- - 1 



TEPKDRGLLRRSSPAADVQEENLYAAVKDTQSEDRVELDSQSPHDEDPQAVTYAPVKHSSPRREMASPPS 

500 510 520 530 540 550 560 

280 290 300 310 
i nputs CLGATI 1 1 ILLGLLAEDWH SRKK(,LCJHRMRALQRPI. PP U'L 



SLSGEFLDTKDRQVEEDRQMDTEAAASEASQDVTYAQLHSLTLRRKATEPPPSQEGEPPAEPSIYATLAI 
570 580 590 600 610 620 630 

inputs A 

FIG. 9 
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*->GesvtLtCsvsgfgppgvsvtWyfkngk. Igpsl Igysysrlesgek 
G+sv L+C+ ++v y + k ++ +++e + 

mT268 42 GQSVILRCQGP PDVDLY-RLEK1KP EKYEDQ-- 71 

an! segrf si ssl tLti ssvekeDsGtYtCvv<- * 
L i + e++++G Y+C 
mT268 72 DFLFIPTMERSNAGRYRCSY 91 

FIG.10A 



*->GesvtLtCsvsgfgppgvsvtWyfkngk. 1 gpsl 1 gysysrlesgek 
G +vtL C++ ++ y k+g++ + Y+r+e + 

mT268 128 GRDVTLKCQSP- - - YSFDEFVLY-KEGDtGP YKRPEKW- Y 162 

an! segrf si ssl tLti ssvekeDsGtYtCvv<-* 
+ ++i++v++ sGtY+C 

mT268 163 RA NFPI ITVTAAHSGTYRCYS 183 



FIG.10B 
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10 



30 



40 



50 



60 



nputs'NSPSPTALFCLGLCLGRV-P^QSGPLPKPSLQALPSCLVPLEKPVTLRCQGPPGVDLYRLEKLSSSRYQD 



70 



mspasptffciglcvWiqt] 3sgplpkpslqagps:lvplggsvilrcqgppdvdlyrleklkpekyed 

20 30 40 50 60 

90 100 110 120 130 



10 
80 



70 



nputs QAVLFIPAMKRSLAGRYRCSYQNGSLWaPSDQLEL'/ATGVFAKPGLSAQPGPAVSSGGDVTLQCQTRYG 



QDFLFI PTMERSNAGRYRCSYQNGSHWSLPILDQLEL I ATGVYAKPSLSAHPSSAVPQGRDVTLKCQSPYS 
80 90 100 110 120 130 140 

14 0 150 160 170 180 190 200 

Wl£ FDQFALYKEGOPAPYKNPERWYRASFP ! flVTAAI'SGTYRCYSFSSRDPYLWSAPSDPLELVVTGTSVTP 



FDEFVLYK EGDTG P YKRP E KW Y R/-.N 
150 160 
21 Q 220 230 
-.putr SRLPTEPPSSVAEFSEATAELTVSFTM'r'V 



:1V~AA!--SGTYRCYCFSSSSPYLWSAPSD?LVLVVTGL5ATP 
70 160 190 2C0 210 
240 250 260 



270 f 



;.R3Itt:P';es:)5pagparqy v T':gnlvriclgavi 



sqvpteesfpvte;sf?.p5ilp - • -TNI IS'.EK-f 

220 230 240 



"AS PES. 3PPIGFAHQHYAI .GNLVRI CLGATI 
L-50 260 270 



280 290 300 310 Sdl 330 
input: LIILAGFLAEDWHSRRKRLRHRGPAVQRPl.PPL^LPQTRFSHGGQDGGRQDVHSRGLCS 



■ 1 1 LLGLLAEDwH"l>kKKCLQHRMF ALQ-, :; L P°lP-..A- 
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